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Fractional generalizations of Young and Brunn-Minkowski 

inequalities 

Sergey Bobkov, Mokshay Madiman, and Liyao Wang 

Abstract. A generalization of Young's inequality for convolution with sharp 
constant is conjectured for scenarios where more than two functions are being 
convolved, and it is proven for certain parameter ranges. The conjecture would 
provide a unified proof of recent entropy power inequalities of Barron and 
Madiman, as well as of a (conjectured) generalization of the Brunn-Minkowski 
inequality. It is shown that the generalized Brunn-Minkowski conjecture is 
true for convex sets; an application of this to the law of large numbers for 
random sets is described. 



1. Introduction 

Let us denote by the Banach space L^iW^^dx) of measurable functions 
defined on R" whose p-th power is integrable with respect to Lebesgue measure dx. 
In 1912, Young 46 introduced the fundamental inequality 

(1.1) ||/*5llr- < i + i-- + l, l<p,<z,r<+oo, 

p q r 

for functions f & and g € L'', which implies that if two functions are in (possibly 
different) L^-spaces, then their convolution is contained in a third L^'-space. In 
1972, Leindler |30j showed the so-called reverse Young inequality, referring to the 
fact that the inequality (jl.ip is reversed when < p,q,r < 1. 

For a long time, identification of the best constant that can be put on the 
right side of ()1.1|) was an open problem. Eventually, Beckner [9] proved Young's 
inequality with the best possible constant. To specify the best constant, first define 
Cp by 

(1-2) C'p^ 
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where, for any p £ (0, oo], p' is defined by 

. ^ 1 1 

1.3 - + 

p p' 



Note that p' is positive for p G (1, ex:), and negative for p g (0, 1). Then the best 
constant in Young's inequahty is {CpCq/Cr)^ . Soon after, Brascamp and Lieb |18| 
gave alternative proofs of both Young's inequahty and the reverse Young inequahty 
with this sharp constant; a simpler and unified proof (of the direct and reverse 
inequalities) using transportation arguments was given by Barthe [7] . Very recently, 
an even simpler proof using entropy inequalities was given by Cordero-Erausquin 
and Ledoux |21j . 

Let us remark in passing that a much more general family of inequalities can be 
proved [Ullll]; these are now known as the Brascamp-Lieb inequalities. An opti- 
mal transportation proof of these was given by Barthe [6j , while an entropy-based 
proof has recently been given by Carlen and Cordero-Erausquin |20j (cf. Lehec 
[29]). Even more general inequalities are shown using a heat flow interpolation 
technique by Bennett, Carbery, Christ and Tao [121 lllj (see also Valdimarsson 
[431 [44]). In another direction. Young's inequality can be extended to more gen- 
eral settings than M"- specifically, to unimodular locally compact groups (see, e.g., 
Fournier [25| , Quek and Yap [36] , Saeki [37] , Baklouti, Smaoui and Ludwig |4] and 
references therein). 

There were several motivations for exploring Young's inequality with sharp 
constant- such as the fact that the optimal constant in the related Hausdorff- 
Young inequality (which turns out to be related to the same Cp) gives the definitive 
formulation of the entropic uncertainty principle, which is a fundamental result in 
quantum mechanics. In spite of the fact that the sharp constant is only very 
slightly better than 1 for large parameter ranges, the slight improvement makes all 
the difference for such applications. 

Quite separately from the functional analytic study of L^'-norm inequalities, 
other mathematical communities were developing inequalities that would later be 
seen to be related. Indeed, Brunn, Minkowski and Lusternik (cf. [39] for the his- 
tory) developed the famous inequality for volumes of Minkowski sums that bears 
their names; this saw enormous development over the following decades, and be- 
came a cornerstone of convex geometry and analysis, apart from finding numerous 
applications in a vast variety of fields. In a completely independent development, 
Shannon [40j proposed the so-called "entropy power inequality" for entropies of 
sums of independent random variables taking values in some Euclidean space, which 
was later rigorously proved by Stam [41) . This inequality in its own way became 
fundamental in information theory, emerging as a key tool in proving the so-called 
converse coding theorems that show the fundamental limits of various data com- 
pression or communication models. Subsequently it was noted by several authors 
that one or both of these inequalities are related to Young's inequality with sharp 
constant; indeed proofs of the Brunn-Minkowski and entropy power inequalities 
based on Young's inequality were given by Brascamp-Lieb [19] and Lieb [31] re- 
spectively (see also Dembo, Cover, Thomas [22]). 

Given the history and importance of the results described above, there is clear 
intrinsic interest in exploring refinements of them, and in particular of Young's in- 
equality with sharp constant. While clearly it is impossible to refine this inequality 
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in the sense of improving the constant, what we explore in this note is the refine- 
ment of it when one is looking at the convolution of more than two Junctions. It 
turns out that in this case, qualitatively different phenomena appear that have fas- 
cinating connections to random set theory and recent developments in information 
theory and probability. 

Since we wish to consider M > 2 functions, let us write [M] = {1, 2, ... , M}. 
Consider a hypergraph Q on [M]. Recall that a hypergraph is just a collection of 
subsets of [M]. 

Our starting point is the following (unpublished) conjecture made by the second- 
named author some years ago; the anonymous referee thought it might have been 
discussed before but we have been unable to find a reference. 



Conjecture 1.1. Let Q be a d-regular hypergraph on [M]. Let {ps : s G t?} and r 

be real numbers in (l,oo) such that 

1 d 



(1.4) 



see 



= \Q\- 



Let fj,j e [M] be probability density functions on M". Then 



(1.5) 



[M] f] 



< 



1 



n 



'"PS 



PS 



Furthermore the inequality is reversed when {ps : s G ^} U {r} C (0, 1). 

We now outline the main results and organization of this note. In Section [21 
Conjecture [TTl] is proven for certain parameter ranges. 

In Section [3J a conjecture about a generalized Brunn- Minkowski inequality for 
the Minkowski sum of more than 2 sets is formulated; it is shown that the conjecture 
certainly holds for convex sets. It is also shown in Section [3] that Conjecture 11.11 
implies the conjectured generalized Brunn-Minkowski inequality for general Borel 
sets. 

In Section m an application of these generalized Brunn-Minkowski inequalities 
to the law of large numbers for random sets is described, after quickly reviewing 
necessary notions from the theory of random sets. 

Section [5] discusses recent generalized entropy power inequalities of [33j . which 
gives evidence towards Coni ecture 11.11 since the former is shown to be a special case 
of the latter. 

Finally, in Section[Sl we make some remarks on the sharpness of Coniecture ll.ll - 
in particular, on the question of when extremizers exist. 

2. A special case 

It is appropriate to recall some terminology from discrete mathematics. A 
collection Q of subsets of \M\ is called a hypergraph, and each set s in C/ is called 
a hyperedge. When each hyperedge has cardinality 2, then Q can be thought of as 
the set of edges of an undirected graph on m labelled vertices. We interchangeably 
use "hypergraph" and "collection" for Q, "hyperedge" and "set" for s in Q, and 
"vertex" and "index" for i in [M]. 

The following definitions are standard. 



Definition 2.1. 

g : iet}\. 



For any index i in [M], define the degree of i m Q as r(i) = |{t G 
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The collection Q is said to be d-regular if each index i in [M] has the same 
degree d, i.e., if each vertex i appears in exactly d hyperedges of Q. 

The following definition extends the familiar notion of a partition of a set by 
allowing fractional counts. The origin of this notion is unclear to us, but see |38j . 

Definition 2.2. Given a collection Q of subsets of [M], a function 7 : ^ [0, 1], 
is called a fractional partition, if for each i £ [M\, we have X^sgc/ igs 7s = 1. 

The following simple lemmas are useful. 

Lemma 2.3 (Fractional Additivity). Let {0^:16 [M]} be an arbitrary col- 
lection of real numbers. For any s C [M], define as = Sjgs "^i- ^''^ ^"■2/ fractional 
partition 7 using any hypergraph Q, a^M] = X^see Ts^^s- 
Proof. Interchanging sums implies 

^ 7s 51 H H 7sl{ies} = 
sgS ies ie[i\i] see 

□ 



ie[M] 



If the hypergraph Q is d-regular, then 

^ d ^ d 

see,S9i see 

which motivates the following definition. 

Definition 2.4. If Q is d-regular, as = ^ defines a fractional partition of [M] using 
t?, which we call the degree partition. 

The following slight extension of Holder's inequality is useful. We adopt the 
notation /s for H^es fj- 

Lemma 2.5 (Fractional Holder inequality) . Letfj,j G [M] be measurable functions 
on R". Let ^ be a fractional partition using the hypergraph Q , and qs be coefficients 
such that 

V — = - 



(2.1) 

Then we have 
(2.2) 



see 



9s 



ii/[M]ii.< nii/«ii?i- 



see 



Proof. Recall that Holder's inequality says 

n < n ii/^iip- 

ie[M] 1 iG[M] 

if J2i ~ = ~- (This is traditionally stated with 9 = 1, but it is easy to deduce the 
form above from that.) Hence, for any fractional partition 7 using 



n 

je[M] 



see ^ jes 



IS 



< 



n n/. 

see jes 



7S 



PS 



where X^see ^/P^ ~ 1/^- -^^^^ ll/s^llps ~ II/sIIt-Ips' ^° ^^^^ obtain the result by 
setting qs — 7sPs to satisfy the constraint (|2.ip . □ 
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In particular, if be a d-regular hypergraph and the coefficients qs satisfy 



see 9s 



we liave 



n 



< 



see 



Combining this elementary observation with the Hausdorff- Young inequality, Con- 
jecture [TTl] follows for a subset of possible parameters. 



Theorem 2.6. Conjecture \l.l\ holds when r > 2 and ps G [1,2] for each s in Q. 

Proof. The proof uses the sharp Hausdorff- Young inequality (also called the 
Babenko-Beckner inequality). The latter states that if / G for p € [1,2], and / 
defined by f{x) = J e'^'^^^^'^^ f{y)dy is its Fourier transform, then 



(2.3) 
Indeed, 



[M] fj 



\\p' <c;\\fi. 



„ T-r - „ 

< ^"'11 n Ml 

(b) r 

see jes 



T-r 

< n 

see 



CI 



PS II ■'Jllps 



where (a) and (c) follow from the Hausdorff- Young inequality, and (b) follows by 



the fractional Holder inequality since (|1.4p implies that J2 
that Cr' — C^^ for r > 1 completes the proof. 



see 



Observing 
□ 



Unfortunately the subset of parameters ps, t covered by Theorem l2.6l is not the 
most interesting subset, at least for the applications we have in mind. 



3. Brunn-Minkowski Inequalities 

Below we always use jifj to denote volume (Lebesgue measure) of a Borel subset 
K of Euclidean space of some fixed dimension n. Let -|- denote the Minkowski sum 
whenever the addition operation is applied to sets. Then the classical Brunn- 
Minkowski inequality states that for any nonempty Borel sets ifi, . . . , Km in M", 



K 



> 



First we propose the following extended Brunn-Minkowski inequality. 

Conjecture 3.1. Let Ki, . . . ,Km he nonempty Borel sets in M". Then for any 
fractional partition j3 using the collection Q of subsets of [M], 



K 



M 



see jes 
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Indeed, observe that this speciahzes to the usual Brunn-Minkowski inequality 
when one takes Q to be the set of singletons, and each Ps — I- 

To see the relationship between Young-type and Brunn-Minkowski-type in- 
equalities, it is useful to define the notion of Renyi entropy, a one-parameter family 
of entropy-like quantities. For any random vector X in R" with density /, and any 
p > 1, the Renyi entropy of X of order p: 



where 



hpiX) = 



ll/l 



P 



p-1 



log- 



1 



Fdx 



i/p 



is the usual L^-norm with respect to Lebesgue measure on R". The definition of 
hp{X) continues to make sense for p 6 (0, 1) even though ||/||p is then not a norm. 
There remain the values p = 0, 1, oo on the non-negative half line; for these values, 
hp{X) may be defined "by continuity". Specifically, as p — > 1, hp{X) reduces to 
the Shannon differential entropy 

(3.1) h{X) ^ hi{X) = - f{x)logfix)dx, 

and as p — !> 0, hp{X) reduces to 

/io(X)-log|Supp(/)|, 

where Supp(/) is the support of the density / (i.e., the closure of the set {a; 6 M" : 
fix) > 0}). 

One may also define the Renyi entropy power of X of order p: 



(3.2) 



Vp{X)=exp^ -hp{X) 



This reduces to the Shannon entropy power for p = 1, and reduces for p = to 
(3.3) V^{X) = |Supp(/)|2/". 

Proposition 3.2. If Conjecture [ITTl is true, then Conjecture \3.1\ is true. 

Proof. The proof we give is an extension of that used by Dembo, Cover 
and Thomas [22] to show that the reverse Young inequality with sharp constant 
implies the usual Brunn-Minkowski inequality, and involves taking the limit in an 
appropriate reformulation of Conjecture [TTT] as r — >■ from above. 

Let Xi be random vectors in with densities fi respectively. The reverse part 
of Conjecture II . II asserts that for any r £ (0, 1) and ps G (0, 1), 



- c" n 

^ see 



C, 



PS 



*jes fj 



PS 
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Taking the logarithm and rewriting the definition p.2p of the Renyi entropy power 

as Vp{X) = ll/llp^^ we have 



(3.4) 



ie[M] 



^l0gK-( J2 ^0 <"logCr-5El0gC', 



PS 



see 
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see 



ies 



It is useful to introduce two discrete probability measures A and k defined on 
the hypergraph ^, with probabilities proportional to l/pg and 1/ps respectively. 
Let us set L,. = r\Q\ — (r — l)d = r{\Q\ — d/r'); then the condition (|1.4p . allows us 
to write explicitly 



(3.5) 

and 

(3.6) 



Ks = ^ — , seG, 

.LrJ PS 



A, 



by also using 1/ps + 1/Ps = 1 for the latter. Then, setting Is = X^ies"'^*' P-^P 
reduces to 

log Vr{Y^M] ) > r' log a' - 7 E log ^Ps + E log Vp^ (Ys). 



see 



see 



We wish to write this only in terms of d, r, and A, so that we can take A to be 
fixed and control all other parameters by tuning r as desired. Towards that end, 
note that 



r' log = - log |r' I H log 



and 



see 



see 



logps logbs 



PS 



P's 



= ^ As log IPsI - E ^°^P^ + E logps, 
see see see 

using the definitions p.6p and (|1.3p of As and pg . Thus one obtains 



log Vr (r[M] ) > E log Vp^iYs) + - log 



(3.7) 



see 



E '^^s 

see 



r 
P's\ 



E(As 

see 



logps- 



The third of the four terms on the right side of ^3.7\i simplifies as 



E 



see 



\P'S 



i7(A)-logd, 
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(dXs) ^ by p.6p . where we use H{X) = — J2seg log As to denote 



since IPsl/k'l 

the discrete entropy of the distribution A. Also, the fourth term simplifies as 



^ ( As - - ) logps 
sea 



d{l 



■E 

see 



Ks 



, 1 , ^ 

log 1- log — 



d{l-r) 

where the first equality follows from the fact that 



H{n)+\og — 



As-^ = 



r 

dps 



r' Lj-Ks LrKs 



d dps dr d{l — r) 

by successive use of (|3.6I) , (I3.5P and (|1.3|) , and from the relation between ps and ks 
in (|3.5p . With these simplifications (|3.7p can be rewritten as 

1 



log Vr {Y[M] ) > E log Vp^ {Ys) - 
see 

+ iJ(A) -logd 



(1-r 

Lr 



■log) 



(3.8) 



H{k) + log — 



J2 As log Vp^ {Ys) + log r 



see 



i?(A) -logd 



d(l - r) 



[HiK)-logLr]. 



d{l - r) 

These computations hold for any r and any {ps}, or equivalently, for any r 
and any A. Let us fix A; thus one can think of the coefficients ps now as functions 
of r. We now choose to send r J, in (|3.8I) . Then Lr — > d, and {Lr — d) \ogr = 
(|^?| — d)r log r 0. Furthermore, from the definitions (I3.5P and (I3.6P of k and A, 



^ = A«^^fl-4) = 



Ac 



1 



Ps 



A«V 



1 - 



dAc 



AsV+ (1 -r)d 



Lr 



1, 



which gives by continuity of the discrete entropy that H{k) — > H{X). Thus, in the 
limit as r J, 0, the inequality (|3.8p becomes 

(3.9) 



logFo(y[M]) >J2XslogVo{Ys) +2[H{X) - logd] 



see 



If Supp(/i) = Ki, then Supp(*igs/i) = J2ies which we may denote by Ks\ 
so (13. 9p simplifies using (|3.3I) to 



logd 



- log \K[M] \>y^Xs- log lifsl + H{X) - logd = V As log ^ 
n — ' n ^ — ' As 

see see 



The right side is clearly maximized by choosing As proportional to \Ks \ " , in which 
case we obtain 



log |i^[M]|" > log^ |/fs|" ~ logd, 



see 



which is precisely the desired result for d-regular hypergraphs Q equipped with the 
degree partition. In fact, assuming the truth of Conjecture 11.11 we have proved 
that Conjecture 13. II is true for all regular multihypergraphs (i.e., collections of sets 
in which a given set may appear multiple times with different labels, and we keep 
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track of the labels in checking regularity). The desired result then follows by a 
bootstrapping argument. □ 



Remark 3.3. To finish the proof, we used the fact that Conjecture 13.11 follows 
from its specialization to c?-regular multihypergraphs G equipped with the degree 
partition. While such a bootstrapping capability appears to be folklore in the 
combinatorics literature, a proof can be found, e.g., in [341 Proposition 1]. The key 
point is that the set of all fractional partitions (when viewed as points in the non- 
negative orthant of ) is a convex, compact set; so linear inequalities hold for 
every fractional partition if they hold for every extreme point of the set of fractional 
partitions. Furthermore, it can be shown that all these extreme points have rational 
coordinates, and thus can be viewed as degree partitions corresponding to certain 
regular multihypergraphs. 

Remark 3.4. In fact, one can state the following fractional formulation of Conjec- 
ture [Ll] for any fractional partition /3 using the hypergraph Q on [A/], any density 
functions {fj,j G [M]}, and numbers {ps '■ s £ Q} and r satisfying 



(3.10) 

we have 
(3.11) 



sea 



Ps 



sea 



1 



[M] fj 



< 



PS 



/3s 



when {ps : s e Cy} U {r} C (1, oo), and the reverse inequality when {ps : s g t/} U 
{r} C (0, 1). Not surprisingly, this formulation would directly yield Proposition 132] 
via the limiting argument outlined above. (However, although this formulation 
appears more general than Coniecture ll.il they are actually equivalent in keeping 
with the previous remark.) 

Remark 3.5. Observe that both the Young and reverse Young inequalities can 
be compactly expressed in the form p.8|) . which holds with the same sign for all 
positive Ps and r. 



For the special case of convex sets, it is easy to see that Conjecture 13. H is true. 
The proof relies on a simple lemma. 

Lemma 3.6. For nonempty convex sets A and B , one has the distributive identities 

{a + b)A = aA + bA and a{A + B) = aA + aB, 

for any non-negative real numbers a and b, whereas these do not hold for general 
sets. 

Theorem 3.7. Let Ki, . . . , Km be nonempty convex sets in M". Then for any 
fractional partition /3 using the collection Q of subsets of [M], 



(3.12) 



\Ki + ... + Km\^ >J2I3sJ2^- 



see 



If the sets Kj are homothetic, one has equality. 
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Proof. By Lemma [3761 for any fractional partition, 

Ki + ... + Km = J2(3sY,Kj. 

seg jes 

Applying the usual Brunn-Minkowski inequality gives 



see jes 



SGS j&s 



The equality conditions for the Brunn-Minkowski inequality for convex sets require 
that the sets be homothetic (i.e., equal upto translation and dilatation). Thus we 
find that one has equality in (|3.12[) if and only if the sets 

J6S 



are homothetic. This is certainly satisfied if the sets Kj are homothetic. 



□ 



Let us note in passing that a different kind of refinement of the Brunn-Minkowski 
inequality for convex bodies that captures the "stability" of the characterization 
of extremizers (homothetic convex bodies) has been recently developed (see, e.g.. 

It is interesting to consider adaptations of Theorem 13. 71 to Gaussian measures. 
In this context, it is useful to recall the current understanding of Brunn-Minkowski- 
type inequalities for Gaussian measure. The first step towards such an inequality 
was implicit in Borell's study of log-concave measures |;14j; in particular, the fact 
that log-concave measures are characterized by log-concave densities implies that 
for Borel sets K.^ C M", and any A G [0, 1], 

(3.13) 7(AXi + (1 - A)if2) > 7(^i)S(i^2)''\ 

where 7 is the standard Gaussian measure on M" . Unlike in the case of Lebesgue 
measure, however, this log-concavity of measure does not imply the Gaussian 
isoperimetric inequality, proved independently by Sudakov and Tsirelson [42j and 
Borell [15j (cf . also (l3j ) . The latter inequality asserts that halfspaces are extremal 
in that they have smallest boundary 7-measure among all sets of given 7-measure. 
A satisfactory strengthening of p.l3p . which implies Gaussian isoperimetry, was 
first obtained by Ehrhard [23j. In its most general formulation, due to Borell |16j . 
it asserts that for Borel sets Ki C M" of positive volume, and any A S [0, 1], 

(3.14) $-! o 7(AXi + (1 - X)K2) > A$"i o j{Ki) + (1 - A)*^^ o 7(^4:2), 

where $ is the cumulative distribution function of the one-dimensional standard 
normal. (This was proved earlier in |23| for closed, convex sets, and by Latala |28| 
when one of the sets is Borel and the other convex.) The inequality (|3.14p has 
been further generalized by Borell (17j (cf. Barthe and Huet [8j and Gardner and 
Zvavitch [26] ). where non-convex combinations are also considered. 

By an argument very similar to that used in proving Theorem 13. 71 we immedi- 
ately obtain the following version for Gaussian measure. 

Theorem 3.8. Let Ki, . . . ,Km be convex sets of positive volume in M". Suppose 
P is any fractional partition using the collection Q of subsets of [M], and that the 
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coefficients Xj > satisfy with X]je[A/] '^j ~ ^- Then we have 

(3.15) ^-'ojfY^ X,K,) >J2PsXs ojfj2^KX 
^jelM] ^ see ^3^^^ ^ 

where As = Eies'^*- 

Proof. Note that 

ie[M] see ies ® 

and Esee '^^As 1. Then apply (|3J4)) . □ 

Note that the assumption of positive volume (or equivalently positive 7- measure) 
in Theorem 13.81 can be removed, provided we adopt the convention cxd — cxd = 
—00 + 00 = —00. For example, if one of the sets is the empty set, then one should 
interpret any Minkowski sum of the empty set with any other sets as the empty 
set, which would make the right side equal to ~cx) and the inequality trivially true. 
It is natural to conjecture that p.lSp continues to hold for all Borel sets. 

4. Applications to random sets 

4.1. Random sets. In order to develop the application of Theorem 13.71 to 
the theory of random sets, let us first outline some basic features of that theory. 
We follow the exposition of Molchanov |35j , which the reader can consult for more 
details. 

A random closed set is a random element in the space T of all closed subsets 
(including the empty set 0) of the basic setting space E = R". To describe the 
corresponding probability measures, one needs to specify a topology and cr-algebra 
on T . For A C M" , introduce sub-classes of T by 

The "hit-or-miss" topology Tjr on the class T is the topology generated by collec- 
tions of sets of the form 

^gi,...,gm = ^ -^Gi n n . . . n Tgm i 

where K runs over the class /C of compact sets in K" , and Gi , . . . , Gm lie in the 
class of open sets in R". It is a classical fact that the topological space (J-", Tjr) is 
compact, Hausdorff and separable. 

A sequence of closed sets Fm, Af > 1, converges in Tjr to a certain closed set 
F if and only if both the following conditions are valid: 

(1) li K \^ F — (f) ior a, certain compact K , then KHFm = (f> for all sufficiently 
large M; 

(2) ii GnF ^ (j) for a certain open set G, then G n Fm ^ for all sufficiently 
large M. 

We then write Fm — F. 

Suppose IC is the class of compact subsets of R", and let Tjc be the topology on 
JC induced by Tf. To ensure the convergence of a sequence Km, M > 1, of compact 
sets in /C an additional condition is required: there exists a compact K' such that 
Km C K' for aU M > 1. We then write Km -^k K. 



12 



BOBKOV, MADIMAN, AND WANG 



The convergence of compact sets in K. can be metrized by means of the Haus- 
dorff metric pn on /C. The Hausdorff distance between two compacts Ki and K2 
is defined as 

Ph{K^,K2) = inf{e > : C K^, K2 C Kf}, 

where if = K + eB is the e-envelope of K, and B denotes the closed ball of unit 
radius centered at 0. The Hausdorff distance between two closed sets is defined 
similarly; however, it can be infinite. 

A random closed set is an J-"- valued random element, measurable with respect 
to the Borel c-algebra ctjf generated by Tjr on J^. Examples of random closed sets 
include random points and point processes, random spheres and balls, random half- 
spaces and hyperplanes etc. The distribution of a random closed set A is described 
by the corresponding probability measure P on ct_f, and hence on sets of the type 
■^Gi Gm- Fortunately, P is determined also by its values on for K running 
through /C only. In fact, the capacity functional of A is defined by 

Ta{K) = P{A e Tk} ^ P{AnK^q}} 

for K ^ IC. The properties of T resemble those of the distribution function. 
Recall that the support function sa of a set A is defined by 

sa{u) = sup (u, x) 

for any u G M". Note that if X is a random closed set, then its Lebesgue measure 
or volume \X\, its norm 

||X||=sup{||a;|! 

and its extent in a given direction Sx{u) are usual real- valued random variables. 
Also, 1 1 AT 1 1 < 00 almost surely if and only if X is compact. 

Define C to be the class of convex closed sets in M". A random closed set is 
said to be convex if its realizations are almost surely convex, i.e., if A belongs to 
C almost surely. Similarly, a random compact, convex set is a random closed set 
whose realizations lie almost surely in C fl /C. 

4.2. Law of large numbers for random sets. To formulate a law of large 
numbers, we first need a notion of expectation for a random set. Aumann [3] 
developed such a notion, which was used extensively in the theory of set-valued 
functions and related optimization problems; later Artstein and Vitale ^ pioneered 
its use in the context of random set theory. 

We now define the Aumann expectation of a random compact set A. A random 
vector <^ in M" (jointly distributed with A on the same probability space) is said to 
be a selector of A if E A with probability one. The expectation of A is defined to 
be the set 

EA — {ES^ : ^is a selector of A, i?^ exists}. 

The condition < cxd is enough to determine that EA is nonempty and com- 

pact. It follows from Aumann [3j that, provided the underlying probability measure 
is non-atomic, EA — Econv{A) and hence EA is convex even for non-convex A. 
In this case, the expectation EA can also be defined as the convex set having the 
support function 

sea{u) = Esa{u) , ueS"-^ 
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this definition continues to make sense for unbounded random sets. 

Tlie foiiowing tlieorem is due to Vitaie [45] . and may be considered a Brunn- 
Minkowsfci inequality for random sets. 

Theorem 4.1. If A is a random compact set with E\\A\\ < oo, then 

\EA\i > E\A\i. 

Artstein and Vitaie [2] developed a law of large numbers for random sets. Their 
approach first reduces the general problem to the case of random compact convex 
sets, and then proves the result for random compact convex sets by invoking an 
appropriate result in the Banach space C(§"~^) and applying it to the support 
functions of random sets. 



Theorem 4.2. Let A, Ai, A2, . . 

E\\A\\ < 00. Then 



be a sequence of i.i.d. random compact sets with 



1 

- V A, -^K. EA 



n 



a.s. as M 



The stage is now set for us to state and prove a monotonicity property in the 
law of large numbers for random sets. 

Proposition 4.3. Let A^ Ai, A2, . . . be a sequence of i.i.d. random compact sets 
with E\\A\\ < 00. If Conjecture \3. 1\ is true, then 



E 



M 



— Ya, 



i=l 



is a non- decreasing sequence in M . 



Since the validity of Conjecture 13.11 is known for convex sets, the statement of 
Proposition 14. 31 is also valid for convex sets. In fact, more is true, but first we need 
to state a classical result (see, e.g.. Beer |10j ). 

Proposition 4.4. Suppose {Ki,i G N) C Cn/C, i.e., each Ki is a compact, convex 
set in M". If ph[Km,K) as M 00. and K eCfMC, then \Km\ \K\. 



Theorem 4.5. Let K,Ki,K2,. 
sets with E\\K\\ < 00. Then 



be a sequence of i.i.d. random compact convex 



E 



1 

M ^ 

1=1 



\EK\ 



M 



In other words, the mean effective radius of the empirical mean based on M obser- 
vations of the random convex set A is a monotonically non- decreasing sequence (in 
M) that converges to the effective radius of the Aumann expectation of K . 

Proof. Consider the hypergraph Qm-i of leave-one-out subsets of [M], i.e., 
Gm-1 = {s C [M] : |s| — M — 1}. This is a d-regular hypergraph with degree 
d = M — 1, so Theorem 13.71 implies that 



Km\ 



> 



1 



M 



T S 

SGC/Af-l 
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Equivalently, 
(4.1) 

Setting 



K 



M 



M 



> 



M ^ 



M - 1 



L 



Km 



M 



and noting that each of the M summands on the right side of (|4.ip has the same 
law as that of Lm~i, we find that 

E[\Lm\"] 

is non-decreasing in M. 

By Theorem 14.11 -E-il-^A/l"] ^ \ELm\"- The i.i.d property and the hnearity of 
the Aumann expectation yield ELm = EK. So i?[|LM| "] will tend to a finite limit 
which is not larger than |i?iir|". On the other hand, since Lm — >j(c EK almost 
surely by Theorem 14. 21 and due to the continuity of the volume functional on /CnC 
asserted by Proposition 14.41 it follows that the limit of \Lm\ exists almost surely, 
and moreover that 



lini 



\L 



\EK \ " a.s. 



Then, by Fatou's lemma, one has 

lim £;[|Lm|"1 > £^[lim inf |La/|"] = £;[ lim \Lm\^ = \EK\^ a.s.. 



M->oo 



Af-i-OO 



- Af^oo 



which is the desired lower bound. Combining the bounds yields VrnvM- 
\EK\'^ , and completes the proof. 



.E\\L 



M\ 



□ 



5. Entropy power inequalities 

We comment here on the connections of Coniecture ll.il with a recently proved 
class of so-called entropy power inequalities. 

For a R"-valued random vector X with density / with respect to the Lebesgue 
measure on K", the entropy (sometimes called differential entropy or Boltzniann- 
Shannon entropy) is given by (|3.1I) . and the (Shannon) entropy power of X is 
M{X) = limit ourselves to random vectors X with h{X) < +00; in 

this case, Af{X) is a non-negative real number. 

Building on work of j33) and resolving a conjecture they made, [34] recently 
showed the following result. 

Theorem 5.1. Let Xi, . . . , Xm be independent W^-valued random vectors, such 
that the entropy of each exists and is finite. Let j3 he a fractional partition using a 
collection Q of subsets of [M] . Then 

M{X^ + ...+ Xm) >YP^^{Y. ^i) ■ 
see ^jes ^ 

Equality holds if all the Xi are normal with proportional covariance matrices. 
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Let us briefly mention some specializations of Tlieorem l5.ll If G is an arbitrary 
hypergraph on [M], |33j showed that 

(5.1) A/-(Xi + ...+Xm)>^^AA(^X,), 

seg jes 

where d is the maximum number of hyperedges in Q in which any one vertex appears 
(and in particular for d-regular hypergraphs). Choosing Q to be the class Gm-i of 
all sets of M — 1 elements yields d — M — 1 and hence 

(5.2) AA(Xi + ... + Xm)> E -^l E ■ 

This inequality was proved by Artstein, Ball, Barthe and Naor 1 , and was used 
by them to affirmatively resolve the long-standing conjecture of monotonicity in 
Barron's entropic central limit theorem [5^. Choosing Q to be the class Qi of all 
singletons in (j5.ip yields d = 1 and hence 

(5.3) A/'(Xi + . . . + Xm) > J2 -^(^j)' 

je[M] 

which is the classical Shannon-Stam entropy power inequality [401 I41j . This is 
already a nontrivial and interesting inequality, implying (as implicitly contained 
in [41j ) for instance the logarithmic Sobolev inequality for the Gaussian usually 
attributed to Gross |27| . 

Theorem 15.11 is related to Conjecture 11.11 indeed the former follows from the 
latter and thus provides some evidence towards the validity of Coniecture ll.il The 
proof of this implication is very similar to that of Proposition [321 except that one 
takes the limit r ^ 1 instead of r — > in the form (|3.8|) of Conjecture 11.11 

6. Remarks on the sharpness of Conjecture 11.11 

Consider the following simple case of Conjecture 11.11 (we only consider the 
generalization of Young's inequality, although similar comments can be made about 
reverse Young), corresponding to n = 1, M = 3 and d — 2. 11^ + ^ + ^ = 3 — p-, 
then 



ll/l*M/3||r < a'V^C^Q ||/l*/2||p^ IIM/alll llM/lll?. 

Given that Young's inequality with sharp constant is (of course!) sharp, and that 
equality can only be attained for Gaussians, it is natural to expect that a similar 
fact holds for Coniecture ll.il However, it turns out that this is not quite the case. 

Take fi to be the density of the non-degenerate normal distribution N{fii, af) 
with mean /i^ and variance af, and plug them into the above inequality to get 

,j_ fr'\^ f r'\^ f r' 
(6.1) x'^fy^t (2 — x — j/)4* ^ I — 



where 



P' J kq' J \t 



and y 



a\ + a2^ + oz^ " erf + cra^ -f (732 ■ 

Note that (x,y) lies in the region {(a;, y) G R^jx < 1, y < 1, a; + y > 1}. Simple 
calculus shows that if 

(6.2) r' < min{p', q', i'}. 
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then ()6.1|) is sharp. On the other hand, if the condition ()6.2p is violated, then the 
right side of ()6.ip still bounds the left side from above, but it is not the best bound 
for the function on the left side. In the rest of this section, we make some remarks 
that attempt to shed light on this observation, which is somewhat unexpected in 
view of the fact that consequences of Conjecture 11.11 such as Conjecture 13.11 and 
Theorem 15.11 are clearly tight (for homothetic convex bodies and Gaussians with 
proportional covariance matrices respectively). 

Let us first examine the way in which Conjecture 11.11 implies Conjecture 13.11 
and Theorem 15. II The strategy was to let r go to some limit (either or 1), while 
keeping the coefficients A constant. This yielded a limit inequality for any fixed A, 
which was then optimized over A to obtain the desired conclusion. Furthermore, 
in both the Brunn-Minkowski and entropy power contexts, the optimal choice of A 
happens to be such that each As is always bounded from above by \/d (or in other 
words, r' < min{pg : s ^ Q}, which is condition (|6.2p for the general case). Thus 
the source of the looseness appears to lie in the fact that there is an optimization 
of the inequality that has not been performed. 

Note that the optimal choice of A in the preceding discussion depends on the 
functions fi. This suggests that it may be interesting to consider the following 
problem: Fix all the functions fi, as well as the parameter r and the d-regular 
hypergraph Q, in Coniccture 11.11 Assuming that the conjecture is true, what are 
the best constants {ps\s G G} such that the inequality will hold? (In other words, 
what is the optimized form of the conjectured inequality without taking a limit in 
r?) Furthermore, does such an optimization always yield a tight bound on the left 
side of the conjectured inequality, which is achieved for Gaussians? 

While we are not able to completely answer these questions, we give some 
indications. Using the reformulation (|3.8p of Conjecture 11.11 in terms of Renyi 
entropy powers, our goal is now to maximize the right side of p.8|) over choice of 
A (which determines {ps\s G G} and k), for fixed functions. The following simple 
lemma is useful. 

Lemma 6.1. Define ff{p) ~ log \ f\^dii, where ^ is any measure on the measure 
space X. Let E = {0 < p < oo\ipf{p) < co}. Then 

(1) E is a convex set. 

(2) On E, ff{p) is a convex function in p. 

(3) On E, ipf{p) is a continuous function. 

(4) In the interior of E, ipf (p) is infinitely differentiable. 

(5) In the interior of E, ffij)) — p '^'^^jf^ is a non-increasing function of p. 

Proof. The first two parts are classical- indeed, the second is Lyapunov's 
inequality. Continuity of ^f{p) on the interior of E follows from its convexity. 
Also, E must be an interval since it is convex- if it includes an endpoint, use 
dominated convergence to show that it is left (respectively, right) continuous at the 
right (respectively, left) endpoint. 

For part (4), suppose {pi,P2) is a subset of the interior of E, so that p2 + e ^ E 
and pi ~ e G E for some e > 0. Let p,q & iPi,P2), with p fixed, and q ^ p. 
Note that on {|/| > 0}, ^'^^^Z^p^ = log(l/l)l/l^j where ^ is between p and q. On 
the set {0 < I/I sc: 1}, bound |log|/|| by Afi|/r% |/|« by \fr and 
by Ml|/|Pl"^ On the set {|/| ^ 1}, bound |log|/|| by Ahlfl^ |/|« by and 
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\ \j_p I by M2\f\^^~^'^- Now use dominated convergence to get the desired result. 
Similarly for higher order derivatives. 

For part (5), note that in the interior of E, the derivative of ipf{p) —p-^i^ is 
simply —ptp'j:{p), which is smaller than or equal to zero due to part (2). So it is a 
non-increasing function in the interior of E. □ 

We can now apply the Lagrange multiplier method to obtain a necessary con- 
dition for optimal coefficients {ps\s & G}- (It is not known to be sufficient since the 
objective function does not appear to be concave.) In the following proposition, we 
adopt the notation /| = *igs/i, and use the fact that for any density /, the quan- 
tity ff{p) ~ P~^i^ from Lemma 16.11 can also be written in terms of the entropy 
of the new density function -jj^j^- 

Proposition 6.2. Suppose fi are densities on M" such that ||/s||p is finite for all 
p e (0, -l-oo) and each s £ Q. Then if the set of nonnegative real values {ps\s G G} 
maximizes the right side of (13.81) . there must exist a constant /3 G K such that the 
stationary conditions 

log 

and 



1 - Ps 



9 

PS 



2h{Fs 



(3 for all s E G 



E 

sea 



Ps r' 



hold, where 



Fs - 



PS 



The equations above do not seem to be explicitly solvable in general. However, 
when each fj is a centered non-degenerate Gaussian with covariance matrix Kj , the 
system of equations above becomes explicitly solvable. Moreover, if one substitutes 
these values of ps into the right side of (|3.8p , one obtain the inequality 



det^( U^EdetME^. 
^j6[A/] ^ seg ^jes 

by tedious but entirely elementary calculations. Observe that this is a special case 
of both Theorem lS.ll fapplied to Gaussians) and Theorem l3. 71 (applied to ellipsoids), 
and that it is tight- in particular, it holds with equality if the covariance matrices 
Kj are proportional. 

Acknowledgments. We are grateful to Professor Richard Vitale for help with 
references, and an anonymous referee for pointing out a number of typos and helping 
to improve exposition. 
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